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ABSTRACT 

We have developed SMMRNA, an interactive data- 
base, available at http://www.smmrna.org, with 
special focus on small molecule ligands targeting 
RNA. Currently, SMMRNA consists of - 770 unique 
ligands along with structural images of RNA mol- 
ecules. Each ligand in the SMMRNA contains infor- 
mation such as Kd, Ki, IC50, ATm, molecular weight 
(MW), hydrogen donor and acceptor count, XlogP, 
number of rotatable bonds, number of aromatic 
rings and 2D and 3D structures. These parameters 
can be explored using text search, advanced 
search, substructure and similarity-based analysis 
tools that are embedded in SMMRNA. A structure 
editor is provided for 3D visualization of ligands. 
Advance analysis can be performed using substruc- 
ture and OpenBabel-based chemical similarity fin- 
gerprints. Upload facility for both RNA and ligands 
is also provided. The physicochemical properties of 
the ligands were further examined using OpenBabel 
descriptors, hierarchical clustering, binning parti- 
tion and multidimensional scaling. We have also 
generated a 3D conformation database of ligands 
to support the structure and ligand-based scre- 
ening. SMMRNA provides comprehensive resource 
for further design, development and refinement of 
small molecule modulators for selective targeting 
of RNA molecules. 

INTRODUCTION 

Recently, RNAs have been unraveled as unique molecules 
playing critical roles in developmental and physiological 
processes in all living organisms (1-5). RNA is involved 
in the progression of diseases such as infectious diseases 
(6-8) (e.g. HIV, AIDS, hepatitis C), metabolic diseases 
(9,10) (e.g. diabetes, cancer) and triplet repeat disorders 



(e.g. myotonic dystrophy, Huntington's disease) (11-21). 
Druggability of RNAs has been validated in high-profile 
targets, for instance, ribosomal RNA (rRNA) could be 
targeted using aminoglycosides, macrolide, tetracycline 
and oxazodinone (22-26). There are clinically approved 
antibiotics (e.g. erythromycin), which bind to RNA mol- 
ecules. Structural analysis of antibiotics bound to riboso- 
mal subunits has revealed that rRNA-small molecule 
recognition is mainly governed by electrostatic interaction, 
shape and hydrogen-bonding interactions (27-28). Recent 
success in crystallization of rRNA offers key structural 
information for RNA-based drug design (28^10). 
Diverse roles of RNAs provide numerous opportunities 
for specific targeting and modulating RNAs with small 
molecules. 

Even though RNA adopts a favorable 3D structure, 
targeting of cellular RNA such as messenger RNA and 
microRNA for drug discovery was considered as 
complicated due to difficulty in crystallization, their con- 
formational sampling and lack of RNA-specific computa- 
tional tools. Excitingly, recent advances in biomedical and 
computational fields have provided vital information on 
RNA-small molecule interaction to overcome these issues 
to develop therapeutics for many diseases (41^44). 
Moreover, new developments in nuclear magnetic reson- 
ance and mass spectroscopy have made it possible to 
screen millions of small molecule compounds for finding 
selective inhibitors for specific RNA target (45-56). 

Recently, number of RNA-based molecular targets 
have begun to grow rapidly with detailed elucidation of 
their structural and functional relationship (57-64). Also, 
small molecule inhibitors have been successfully developed 
for various different RNA molecules (65-75). Numerous 
publicly available databases are available providing infor- 
mation about RNA sequence, secondary structure and 3D 
RNA structure (76-86). To the best of our knowledge, 
there is no database that focuses on small molecule modu- 
lators along with their target RNA and experimentally 
determined binding data (K d , K ;> IC 50 , ATm) of the cor- 
responding RNA-inhibitor complex. The experimental 



*To whom correspondence should be addressed. Tel: +91 172 6665488; Fax: +91 172 2690585; Email: rparkesh(5 imtech.res.in 
The authors wish it to be known that, in their opinion, the first two authors should be regarded as Joint First Authors. 
© The Author(s) 2013. Published by Oxford University Press. 

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/ 
by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial 
re-use, please contactjournals.permissions@oup.com 



Nucleic Acids Research, 2014, Vol. 42, Database issue D133 



data published in the literature are unstructured and dif- 
ficult to navigate and as a result challenging to perform 
structure activity prediction. Moreover, chemical struc- 
tures are depicted as ChemDraw images and are thus 
not searchable. The process of discovery of novel ligands 
to target RNA would be greatly facilitated by the avail- 
ability of a small molecule database that is RNA-specific. 
In this regard, it would be highly valuable to collocate, 
organize and integrate RNA and their modulators along 
with experimental data in a publicly available domain that 
can be effectively navigated. SMMRNA is the first step in 
this direction. SMMRNA would facilitate the investiga- 
tion of structure activity relationship between RNA and 
ligands, clustering of small molecules targeting RNA, 
provide information for fragment-based drug design and 
also aid in the quantitative structure activity relationship 
modeling of RNA. 

MATERIALS AND METHODS 

Data collection 

The structural and experimental data were manually col- 
lected from peer-reviewed journals such as Biochemistry, 
Journal of American Chemical Society, Nature, Science 
and Journal of Molecular Biology. The relevant articles 
were selected from Pubmed, Web of Science and Google 
Scholar using keyword searches such as 'small molecule 
targeting of RNA', 'small molecule RNA inhibitor' and 
'targeting RNA with small molecules'. The details of 
RNA secondary structure, ligand structure, binding 
constant, IC 50 , binding mode and assay performed were 
gathered from each article. The references provided within 
the articles were consulted to get further information 
about RNA secondary structures, ligand structures, 
experimental binding affinities and binding mode of 
various ligands. RNA structures were drawn as images 
using Adobe Photoshop (v4). Structures of ligands were 
drawn using ChemBioDraw Ultra (vll.O) and saved as sdf 
file format. In many cases, the chemical structure of ligand 
molecules, drawn as scaffold and abbreviated with 
i?-group substituent in the published articles, was 
redrawn in computer-readable form. Before importing 
these structures into the database, structures were 
manually checked for atom valence and correctness of 
representation. 

Database description 

The database is built using Apache HTTP (web server) 
along with MySQL (database server). MySQL RDBMS 
(relational database management system) is used for 
storing data (see the Supplementary File SI for a 
complete description of MySQL database). The general 
layout of SMMRNA showing homepage and search, 
browse, upload, literature and help tools are shown in 
Figure 1. 

Substructure search 

The command line version of VLifeMDS [VLifeMDS: 
Molecular Design Suite, VLife Sciences Technologies 



Pvt. Ltd., Pune, India, 2013 (www.vlifesciences.com)] 
was used for performing a substructure search. The 
main steps involved in substructure search (as imple- 
mented in VLifeMDS) were: (i) Inputs were user-defined, 
consisting of template molecules to be matched against the 
reference molecule. Reference molecules were already 
stored in the database, (ii) Details of the molecules were 
built that include number of atoms (of different atom 
type; H, O, N, CI, F and C), number of bonds (based 
on their types; single, double or triple), number of rings 
(3-, 4-, 5- and 6-member rings) and number of aromatic 
rings, (hi) The first step eliminates the substructure search 
based on the configuration set above. If any one of the 
configurations of template molecule was greater than a 
reference molecule, then no further processing was done. 
For instance, if the number of F atoms in the template was 
3 and the number of F atoms in reference was 2, then 
obviously the template substructure did not match with 
that of the reference, (iv) An atom from template mol- 
ecules was selected (this could be the atom with 
maximum number of connections). Next, the graph of ref- 
erence molecule was traversed to find atom with the same 
degree and type as the template atom, (v) Each atom of 
the template was traversed recursively in a similar manner. 
If at any stage, no matching template atom was found in 
the reference molecule, the search was terminated. If all 
the atoms of the template molecule were matched with the 
reference molecule, the search was deemed successful. 

2D structure editor for structure search 

VLife2Draw (87) was used as an editor and viewer for 
searching 2D molecular structures in the SMMRNA 
database (Figure 3A). This is a Java applet and can be 
run in a browser with a suitable Java plugin. VLife2Draw 
allows the user to draw chemical structures, and to import 
and export these structures in molecule file formats (mol 
and sdf are supported formats). VLife2Draw provides 
basic chemical structure editing facility to draw bonds 
like single-, double-, triple-, stereo-up and stereo-down 
bonds. Templates for drawing 3-8-membered ring struc- 
tures were also provided. 

VLife2Draw applet runs on the client side, where a user 
can draw a structure that needs to be searched. After 
submitting the structure, applet converts these data into 
an sdf file format string, which is then transferred to the 
server. This information is captured by the server and 
stored in a temporary file. A Python script, written using 
VLifeMDS scripting framework, then uses this template 
molecule to search the SMMRNA database for ligand 
molecules that satisfy the structural similarity as indicated 
in the substructure search methodology. 

Descriptor calculation 

The command line version of VLifeMDS was used for 
calculating descriptors for ligands (Figure 2C). The de- 
scriptors computed by VLifeMDS are shown in Table 1. 

3D structure viewing 

The SMMRNA database contains ligand entries as 2D 
coordinates. For 3D depictions, the 2D coordinates are 
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Figure 1. Homepage of SMMRNA database showing various analysis functionalities such as search, browse, upload, etc. 



first curated for errors followed by the addition of appro- 
priate number of hydrogen atoms. The structures are sub- 
sequently optimized using Merck molecular force field 
(MMFF), using a loose gradient cutoff (1.0 kcal/mol/ang- 
stroms). The optimized structures were further processed 
and stored into the database in a separate field in the 
MOL2 file format. JMol molecular viewer was used for 
visualization of 3D structures of ligands (Jmol: an open- 
source Java viewer for chemical structures in 3D. http:// 
www.jmol.org/). A standard ball-and-stick model is used 
to display the ligands (Figure 4A). Additionally, other 
structural displays such as Corey-Pauling-Koltun (CPK) 
spacefill, sticks and wireframe representation are possible 
by right clicking (using mouse or touchpad) and then se- 
lecting the renderMenu from the JMol dialog box (Figure 
4B). It is also possible to show various molecular surfaces 
such as Van der Waals, solvent accessible, dots, charge, etc 
(Figure 4C). CPK coloring scheme of atoms is used to 
depict the atom types; with the prominent atom types 
listed as Carbon-dark gray, Hydrogen-light gray, 
Oxygen-red and Nitrogen-green (92,93). 

Clustering analysis 

Hierarchical, binning and multidimensional cluster 
analysis of the database compounds were performed 
using the Chemmine software (http://chemmine.ucr.edu/) 
(94). 



3D conformer generation 

High-quality 3D conformers were generated using 
OMEGAv2.5 from openeye (http://www.eyesopen.com/) 
(95). Omega uses model building and torsional search 
for efficient conformational sampling. In the first step of 
model building, the chemical structure is fragmented 
along sigma bonds using the makefraglib utility of 
OMEGAv2.5, and then the structure is reconstructed by 
assembly of these fragments. These fragments were refined 
using modified MMFF94. In the second step, an ensemble 
of conformers was generated using torsional angle rules as 
described. For each molecule, 100 conformations were 
generated with energy cutoff of 10 kcal/mol. The conform- 
ational searches were terminated when the energy cutoff 
exceeded 10 kcal/mol or when the 100 structures were built 
(configuration parameter file is described in the 
Supplementary File S2). 

RESULTS AND DISCUSSION 

Organization and data retrieval of SMMRNA 

SMMRNA database is accessible via a user-friendly 
graphical user interface (GUI) at the web address www. 
smmrna.org. The interface allows textual search for both 
RNA and ligand (Figure 2), substructure and fingerprint 
search for ligands (Figure 3). In addition, browse facility is 
also provided to quickly locate ligands and RNA. The 
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Figure 2. Examples of text and advanced property-based search in SMMRNA. (A) Screenshot showing dialog box for RNA and ligand text-based 
search. (B) Screenshot showing results based on RNA text search, e.g. using yeast t-RNA as text string, the information about the RNA and various 
ligands that target this particular RNA is depicted. (C) Screenshot showing dialog box for ligand search based on advanced properties such as MW, 
IC50, hydrogen bond donor, acceptor count, etc. (D) Screenshot showing results based on MW cutoff of <500. 



database also allows ligand-based substructure and finger- 
print search (Figure 3). The ligands can be searched by 
their common or IUPAC names. For a molecule, this 
search would retrieve all the RNAs where the molecule 
acts as modulator together with related experimental 
data. Additionally for each compound, 2D and 3D struc- 
tures, along with other calculated properties, appear in the 
drop down menu. Both 2D and 3D structures of the ligand 
can be downloaded (Figure 4D). For 3D representations 
of ligands, 2D structure was first corrected for errors, and 
then an appropriate number of hydrogen atoms were 
added. These 3D structures are energy minimized with 
MMFFs using gradient convergence of 1.0kcal/(molA). 
The energy-optimized structures are further processed 
and stored into the database in a separate field in the 
MOL2 file format. 

Online tools for structure and similarity search 

We have implemented many web-based tools for struc- 
ture-based search and the similarity analysis of various 
molecules reported in the SMMRNA. These tools are as 
described later in the text. 



Advanced search 

Under the search menu, an advanced tab is inserted to 
explore ligands using physicochemical properties such as 
molecular weight, XlogP and number of rotatable bonds 
(Figure 2C). The data generated can be filtered using a 
specific property or a combination of properties such as 
acceptor count, XlogP, etc. Each property could be 
controlled by an operator (greater than, less than or 
range) and a subsequent value for filtering. Users can 
further limit their search by applying a filter based on 
RNA names, if they are interested only in a single RNA 
or a class of RNA molecules. 

Substructure search 

A structure-based search could be easily performed by 
using powerful editors embedded within the web page. 
The user can draw a complete or partial structure of the 
query ligand to retrieve similar compounds (Figure 3). 
VLife2Draw applet runs on the client side, where a user 
can draw a structure that needs to be searched. After 
submitting the structure, applet converts these data into 
an sdf format string, which is then transferred to the 
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Table 1. Various descriptors along with their description and significance used for advanced search criteria 



Descriptor 



Comments 



Acceptor count Number of hydrogen bond acceptors in the molecule (88). 

Donor count Number of hydrogen bond donors in the molecule (88). 

Rotatable bonds Provides information about conformation sampling of a molecule, which is useful in docking This is important in docking 

where a particular conformation might show favorable binding. 
Aromatic rings Number of aromatic rings in a molecule. Generally, the presence of aromatic rings in a molecule result in the 

possibility of 71-71 interactions prevalent in binding sites. 
XlogP Distribution-coefficient of the inhibitors, drug like properties and lipophilic efficiency (89-91). 
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Figure 3. Examples of substructure and chemical similarity-based search in SMMRNA. Screenshots of (A) substructure query, (B) similarity search 
and (C and D) the data obtained using substructure and similarity-based search, respectively. 



server. The server side script captures this information and 
stores it into a temporary file. A Python script, written 
using VLifeMDS scripting framework, then uses this 
template molecule to search the SMMRNA database for 
ligand molecules that satisfy the structural similarity as 
indicated in the substructure search methodology. The 
steps implemented in VLifeMDS for performing a 



substructure search are as follows: (i) user-defined input 
to be matched against the database molecule; (ii) molecule 
generation using atoms (H, O, N and C), bonds (single, 
double and triple) and carbocyclic, heterocyclic and 
aromatic rings of different sizes (3-, 4-, 5-, 6-membered); 
(iii) atom template matching between the input and the 
database reference molecules; and (iv) graph matching 
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between input and reference molecule to establish con- 
nectivity. If there is a match between atom template and 
graph connectivity between input and reference molecule, 
the search would return with the matching list of struc- 
tures. Graph matching is based on graph theory, which 
involves modeling the pairwise similarity between two 
molecules. In VLifeMDS, molecules are represented by 
graphs, which consist of vertices (atom) and edges (bond 
connecting atoms). Vertices are pictorially represented as 
'dots' or 'circles', whereas the edges are represented by 
drawing an arc. Mathematically, a graph can be repre- 
sented by a matrix of atoms and bonds, and each entry 
(atom, bonds) contains the bond endpoint data. For sub- 
structure search matching, all it involves is to determine 
which graphs in a database are subgraph isomorphic to a 
query graph (see the Supplementary File S3 for illustration 
of graph matching). 



Similarity search 

Ligand similarity search of SMMRNA is quite versatile, 
as it has the capability to identify as well as rank the 
structures based on the query. A fingerprint represents a 
predefined fragment or feature found in a molecule. The 
fingerprint search uses a binary fingerprint for the chemical 
structure. A fingerprint is a series of binary (1/0) bits that 
are arranged in sequence. Each bit position corresponds to 
certain information such as the presence or absence of an 
atom, type of ring, element count and a substructure. 
During a substructure similarity search, a fingerprint of 
the query is computed initially. Tanimoto coefficient is 
based on the fingerprints, which are binary vectors, with 
a value of 1 and 0, with 1 indicating the 'presence of and 0 
indicating the 'absence of fingerprints. Tanimoto coeffi- 
cient quantifies the similarity between query and database 
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molecules and has a similarity score range from 0.0 to 1.0. 
For computing Tanimoto coefficient, each ligand in the 
SMMRNA has a pre-computed fingerprint incorporated. 
The generated fingerprint is then matched with every entry 
in the database, and a similarity score, which lies between 
0.0 and 1.0, is computed. The structures are sorted in des- 
cending order relative to similarity score, with the top hits 
representing the most similar structure to the search query 
(Figure 3D). SMMRNA uses OpenBabel fingerprints 
(http://openbabel.org/) search for similarity analysis (96). 

Significance of the database 

We have reviewed >900 original research articles and col- 
lected all the relevant information. In SMMRNA, we have 
not included the PubChem Bioassay data to avoid dupli- 
cation and problems about the credibility and validation 
of high-throughput screening (HTS) data. The SMMRNA 
database contains unique molecules and includes only 
those molecules for which the experimental data are 
reported in the peer-reviewed literature. In few articles, 
experimental parameters were not reported, although the 
binding to RNA molecules was confirmed and validated 
with various spectroscopic and other binding techniques 
such as gel shift and radioactivity assay. We have included 
these molecules and annotated comments and references 
to highlight these observations so that the user can easily 
verify and cross validate. 

The SMMRNA database is very diverse and includes 
molecular classes such as antibiotics, peptides, amino 
acids, modified amino acids, dyes, nucleotides, 
polycations, intercalators, inorganic and organometallic 
compounds, which are unique to this database (see the 
Supplementary File S4). One particular observation is 
that many antibiotics act as ligands for RNA. Literature 
analysis reveals (97) that there are eight important clinic- 
ally approved antibiotics, whose mechanism of action 
involves RNA binding, and the research and development 
efforts on antibiotics in recent years have severely 
declined. The data provided here are appealing for antibi- 
otics research and would aid in the design of improved or 
completely new antibiotics (using various computational 
methods). 

Chemical space analysis of database molecules 

RNA is a highly charged molecule and possesses a high 
degree of conformational flexibility. Thus, it is important 
in the early drug discovery stages to predict and optimize 
the favorable physicochemical properties of ligands to 
facilitate their efficient binding to RNA molecules. To 
understand whether various filters that are commonly 
applied to other biological targets are effective, we 
analyzed the molecules in the SMMRNA database using 
standard filters such as Lipinski Rule of Five, ZINC and 
PAINS filter (98-100). ZINC filter is less restricted 
compared with other filters and uses relaxed cutoff value 
of various physicochemical parameters such as molecular 
weight, hydrogen bond donor and acceptor count, logP 
and number of rotatable bonds (see the Supplementary 
File S5 for ZINC filters used). About two-third of the 
molecules failed to pass these filters. These results are 



important in understanding the chemical space, which 
we should focus in developing ligands for targeting 
RNA. For targeting RNA, specific physicochemical par- 
ameters such as more hydrophilicity, lower lipophilicity 
(especially for developing new antibiotics), high molecular 
weight and larger polar surface area are required. Most of 
the ligands that bind to RNA are highly charged, e.g. 
polycation derivatives and aminoglycosides. However, 
there are small drug-like molecules such as DAPI, etc, 
which can be optimized to bind RNA with varying 
binding affinity. It is thus possible to develop lead-like 
molecules to target RNA by modification of these mol- 
ecules using organic synthesis. From the analysis of the 
molecules in SMMRNA, it is clear that we need to 
develop separate filter parameters for analyzing ligands 
targeting RNA. In particular, we need to be flexible in 
terms of hydrogen bond donor, hydrogen bond 
acceptor, number of rotatable bonds, ring size to design 
and synthesize diverse and optimized ligands of higher 
selectivity and sensitivity. Also, majority of RNA target- 
ing molecules have significantly higher topological polar 
surface area as compared with the drug-like molecules, 
where the value is around <140A 2 . These results are 
highly important, as recent reviews (97) have highlighted 
the need to develop entirely different physicochemical 
parameters for targeting RNA. 

Clustering analysis 

Clustering is an important tool for medicinal or computa- 
tional chemist and is widely used in application ranging 
from drug discovery to lead optimization (101,102). 
Clustering techniques are used to identify molecule scaf- 
folds that share common structural properties (102). It is 
widely accepted that structurally similar molecules will 
show similar biological activity profile (103). ChemMine 
tools for clustering analysis include hierarchical clustering, 
multidimensional scaling (MDS) and binning clustering. 
We have used these clustering tools to identify common 
scaffolds based on structural and molecular descriptors. 
This will lead to the identification of potential molecular 
entities for structure activity relationship. We have per- 
formed hierarchical, binning and MDS to identify and 
visualize the chemical space of ligand molecules. 

We have performed agglomerative hierarchical cluster- 
ing method, which involves a progressive combination of 
the most similar molecules into related groups, and the 
results are shown as a dendrogram, highlighting the rela- 
tionship between clusters. It has the advantage that any 
number of clusters can be selected. However, it is difficult 
to visualize the chemical space covered by database mol- 
ecules. For visualization of chemical space, we performed 
binning partitioning clustering and MDS. Bin-based par- 
titioning methods are useful for the identification of 
underrepresented chemical space, which can be fulfilled 
by synthesizing more compounds. In bin-based partition- 
ing method, we subdivided multidimensional chemical 
space into various bins using a Tanimoto similarity 
cutoff of 0.4. Compounds falling into the same bin 
volumes are deemed to have similar chemical properties. 
Bin-based partitioning of database compounds, which 
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passed the ZINC filter resulted in the creation of bins with 
variable bin sizes. We observed many bin clusters with 
sizable number of similar molecules. For example, there 
is an interesting cluster of antibiotics that target RNA 
molecules. We observed a number of bin clusters repre- 
sented by just single molecular scaffold. We believe that 
greater medicinal chemistry effort should be made to 
expand the compounds representing these particular 
scaffolds. 

3D conformation 

For high-throughput virtual screening, such as ligand and 
structure-based drug design, we generated a multicon- 
former database of 647 compounds using MMFF94. 
For each compound, 100 conformers were generated 
specifying the energy cutoff of lOkcal/mol and root- 
mean-square deviation of 0.5. For some compounds, the 
number of conformers generated was less than the 
specified as a result of energy and root-mean-square devi- 
ation criteria. Around 128 compounds failed as a result of 
the criteria specified. We believe this multiconformer 
database (with 33 866 conformations, see the 
Supplementary File S6) will be very useful, as it provides 
the structural information about the conformational states 
of all the compounds, which is vital for in silico drug 
designing. 

CONCLUSIONS AND FUTURE PROSPECTS 

We have developed SMMRNA (www.smmrna.org), a 
chemoinformatics platform to facilitate interactive 
exploring of RNA molecules and their modulators. The 
database comprises structural images of RNA, chemical 
structure of ligands, related experimental data (K d , K ; , 
IC50, ATm values), 3D conformers and chemoinformatic 
information. The database can be browsed using RNA 
names, ligand structures, substructures and fingerprint- 
based chemical similarity search (see the Supplementary 
File S7, which shows the video tutorial for navigating 
SMMRNA database). SMMRNA would facilitate struc- 
ture activity relationship studies, statistical analysis, frag- 
ment-based drug design, virtual screening and molecular 
docking studies to assist in developing RNA-based small 
molecule modulators (for comparison with other data- 
bases see the Supplementary File S8). In general, this 
publicly available database would be beneficial for the 
design and discovery of modulators targeting RNA- 
mediated diseases such as diabetes, neurodegenerative dis- 
orders and cardiovascular disorders. We are continually 
reviewing the older literature, patents, etc, so as to include 
additional experimentally validated ligand entries. We 
have also provided the upload facility for both ligand 
and RNA entries for other researchers so that 
SMMRNA will continue to grow and thus provide com- 
prehensive RNA-ligand data. In future versions, we would 
include RNA structure drawing tool, RNA alignment 
tool, fragment generation capability, 3D structure com- 
parison, automatic docking protocol and principal com- 
ponent analysis. The main aim of the database is to 
provide computational and medicinal chemistry tools for 



research community who are interested in medicinal chem- 
istry, biology and biochemistry of RNA. We believe that 
the set of tools provided in SMMRNA would invite sci- 
entists from diverse backgrounds to initiate drug design 
and development of small molecule modulators for target- 
ing various RNA molecules. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online. 
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